neil armstrong
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Lupidi, Alisia, Gemmell, Carlos, Cancedda, Nicola, Dwivedi-Yu, Jane, Weston, Jason, Foerster, Jakob, Raileanu, Roberta, Lomeli, Maria
Large Language Models still struggle in challenging scenarios that leverage structured data, complex reasoning, or tool usage. In this paper, we propose Source2Synth: a new method that can be used for teaching LLMs new skills without relying on costly human annotations. Source2Synth takes as input a custom data source and produces synthetic data points with intermediate reasoning steps grounded in real-world sources. Source2Synth improves the dataset quality by discarding low-quality generations based on their answerability. We demonstrate the generality of this approach by applying it to two challenging domains: we test reasoning abilities in multi-hop question answering (MHQA), and tool usage in tabular question answering (TQA). Our method improves performance by 25.51% for TQA on WikiSQL and 22.57% for MHQA on HotPotQA compared to the fine-tuned baselines.
- North America > United States (0.47)
- North America > Mexico (0.04)
- North America > Canada (0.04)
- (5 more...)
Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Wang, Haotian, Du, Xiyuan, Yu, Weijiang, Chen, Qianglong, Zhu, Kun, Chu, Zheng, Yan, Lian, Guan, Yi
Multi-agent debate systems are designed to derive accurate and consistent conclusions through adversarial interactions among agents. However, these systems often encounter challenges due to cognitive constraints, manifesting as (1) agents' obstinate adherence to incorrect viewpoints and (2) their propensity to abandon correct viewpoints. These issues are primarily responsible for the ineffectiveness of such debates. Addressing the challenge of cognitive constraints, we introduce a novel framework, the Multi-Agent Debate with Retrieval Augmented (MADRA). MADRA incorporates retrieval of prior knowledge into the debate process, effectively breaking cognitive constraints and enhancing the agents' reasoning capabilities. Furthermore, we have developed a self-selection module within this framework, enabling agents to autonomously select pertinent evidence, thereby minimizing the impact of irrelevant or noisy data. We have comprehensively tested and analyzed MADRA across six diverse datasets. The experimental results demonstrate that our approach significantly enhances performance across various tasks, proving the effectiveness of our proposed method.
- North America > United States > Illinois > Cook County > Chicago (0.05)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- North America > United States > New York (0.04)
- (3 more...)
- Government > Space Agency (0.72)
- Government > Regional Government > North America Government > United States Government (0.72)
- Transportation > Air (0.69)
- Leisure & Entertainment > Sports (0.46)
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text
Zhao, Wenting, Liu, Ye, Niu, Tong, Wan, Yao, Yu, Philip S., Joty, Shafiq, Zhou, Yingbo, Yavuz, Semih
Large Language Models (LLMs) have exhibited impressive generation capabilities, but they suffer from hallucinations when solely relying on their internal knowledge, especially when answering questions that require less commonly known information. Retrieval-augmented LLMs have emerged as a potential solution to ground LLMs in external knowledge. Nonetheless, recent approaches have primarily emphasized retrieval from unstructured text corpora, owing to its seamless integration into prompts. When using structured data such as knowledge graphs, most methods simplify it into natural text, neglecting the underlying structures. Moreover, a significant gap in the current landscape is the absence of a realistic benchmark for evaluating the effectiveness of grounding LLMs on heterogeneous knowledge sources (e.g., knowledge base and text). To fill this gap, we have curated a comprehensive dataset that poses two unique challenges: (1) Two-hop multi-source questions that require retrieving information from both open-domain structured and unstructured knowledge sources; retrieving information from structured knowledge sources is a critical component in correctly answering the questions. (2) The generation of symbolic queries (e.g., SPARQL for Wikidata) is a key requirement, which adds another layer of challenge. Our dataset is created using a combination of automatic generation through predefined reasoning chains and human annotation. We also introduce a novel approach that leverages multiple retrieval tools, including text passage retrieval and symbolic language-assisted retrieval. Our model outperforms previous approaches by a significant margin, demonstrating its effectiveness in addressing the above-mentioned reasoning challenges.
- Europe > United Kingdom > England > Greater London > London > Wimbledon (0.05)
- North America > Canada > Ontario > Toronto (0.04)
- Africa > Rwanda > Kigali > Kigali (0.04)
- (13 more...)
- Leisure & Entertainment (1.00)
- Government > Space Agency (1.00)
- Government > Regional Government > North America Government > United States Government (0.93)
Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation
Chen, Yuetian, Shi, Bowen, Si, Mei
Artificial intelligence has made significant progress in natural language processing, with models like GPT-3 demonstrating impressive capabilities. However, these models still have limitations when it comes to complex tasks that require an understanding of the user, such as mastering human comedy writing strategies. This paper explores humor generation using GPT-3 by modeling human comedy writing theory and leveraging step-by-step thinking instructions. In addition, we explore the role of cognitive distance in creating humor.
- Asia > Middle East > UAE (0.17)
- North America > United States > New York > Rensselaer County > Troy (0.04)
- North America > Costa Rica > San José Province > San José (0.04)
- North America > United States (0.36)
- Antarctica (0.05)
- Government > Space Agency (0.54)
- Government > Regional Government > North America Government > United States Government (0.36)
How NASA's spacesuits have changed through the years
They began as tinfoil-like onesies that first took humanity beyond the confines of Earth. But NASA's spacesuits have evolved dramatically through the years to help astronauts survive in the vacuum of space and ultimately allow a man to hop, skip and jump on the moon. Now, the US space agency has revealed its next generation of spacesuit -- expected to be ready for the Artemis III mission that could land the first woman and first person of colour on the lunar surface by 2025. The new suit is said to be a better fit for female space travellers and comes with specialist features to support astronauts as they carry out scientific experiments on the moon. But what were the various spacesuits that came before it?
- North America > United States > Florida > Brevard County > Cape Canaveral (0.05)
- Asia > Middle East > Jordan (0.05)
- Government > Space Agency (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
The ChatGPT bot is causing panic now – but it'll soon be as mundane a tool as Excel John Naughton
So the ChatGPT language processing model burst upon an astonished world and the air was rent by squeals of delight and cries of outrage or lamentation. The delighted ones were those transfixed by discovering that a machine could apparently carry out a written commission competently. The outrage was triggered by fears of redundancy on the part of people whose employment requires the ability to write workmanlike prose. And the lamentations came from earnest folks (many of them teachers at various levels) whose day jobs involve grading essays hitherto written by students. If we know anything from history, it is that we generally overestimate the short-term impact of new communication technologies, while grossly underestimating their long-term implications.
- North America > United States (0.17)
- Europe > United Kingdom (0.05)
60th anniversary of JFK's 1961 speech to land on the moon in spotlight as NASA returns in 2024
On May 25, 1961, President John F. Kennedy delivered a 46-minute speech that included historical context of the Cold War and how the US planned to triumph over the Soviets, but what won the hearts of the American people was his plan to send humans to the moon. 'Space is open to us now; and our eagerness to share its meaning is not governed by the efforts of others. We go into space because whatever mankind must undertake, free men must fully share...,' the late president said while standing behind the lectern during a joint session of Congress. 'First, I believe that this nation should commit itself to achieving the goal, before this decade is out, of landing a man on the moon and returning him safely to the earth.' The US had not even sent a human into orbit at the time of the speech, which placed it far behind the Soviets who had sent an astronaut to space a month before Kennedy addressed the nation.
- North America > Cuba (0.29)
- North America > United States > Texas > Harris County > Houston (0.16)
- Asia (0.15)
- (2 more...)
- Government > Space Agency (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
A Deeper Look into AI and Space: Benefits and Challenges
On July 20, 1969, Neil Armstrong became the first person to walk on the moon. After almost a decade of designing, building, testing, and hypothesizing, we had succeeded -- a man was finally on the moon. Since then we have expanded our sights not only to the moon, but to planets, other moons, and asteroids. Modern space exploration is a vast field of possibilities beyond the limits of Earth's atmosphere where we can increase our knowledge of the cosmos and benefit humanity. As humans, we seek to answer the fundamental questions of our purpose in the universe: Is there life outside of Earth? What are we doing here? Where do we come from? To answer these seemingly endless questions, we look to space, an infinite?
A colony of 'avatar robots' will replace astronauts and operate a long-term base on the MOON
Russia is developing a moon base which will be operated with remote-controlled avatars, according to the country's space boss. Dmitry Rogozin, head of Roscosmos, has laid out plans to put robotic avatars on our natural satellite and have them operated by people on Earth. He claimed this endeavour is more ambitious than the iconic US'Apollo' programme of the '60s and '70s. 'This is about creating a long-term base, naturally, not habitable, but visited. But basically, it is the transition to robotic systems, to avatars that will solve tasks on the Moon surface,' Mr Rogozin said.
- Asia > Russia (0.39)
- North America > United States > Florida > Brevard County (0.05)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)
- (2 more...)
- Government > Space Agency (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)